OpenMP parallelization in the NFFT software library

نویسنده

  • Toni Volkmer
چکیده

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Support for Thread-Level Speculation into OpenMP

– In-depth knowledge of the problem. – Understanding of the underlying architecture. – Knowledge on the parallel programming model. • OpenMP allows to parallelize code “avoiding” these requirements. • Compilers’ automatic parallelization only proceed when there is no risk. • Thread-Level Speculation (TLS) can extract parallelism when a compile-time dependence analysis can not guarantee that the...

متن کامل

Tile Reduction: The First Step towards Tile Aware Parallelization in OpenMP

Tiling is widely used by compilers and programmer to optimize scientific and engineering code for better performance. Many parallel programming languages support tile/tiling directly through first-class language constructs or library routines. However, the current OpenMP programming language is tile oblivious, although it is the de facto standard for writing parallel programs on shared memory s...

متن کامل

Keynote 4: Can Parallel Software Catch up with Parallel Hardware? Trends in Automatic Parallelization

upercomputers have to be proved powerful for various fields including the development of advanced technologies such as large-scale scientific and engineering computing, new material manufacture, nuclear fusion simulation, and automotive design. On October 20, 2004, NEC Corporation announced the availability of their new supercomputer ‘SX-8’, the world’s most powerful vector supercomputer with a...

متن کامل

A high-performance face detection system using OpenMP

We present the development of a novel high-performance face detection system using a neural network-based classification algorithm and an efficient parallelization with OpenMP. We discuss the design of the system in detail along with experimental assessment. Our parallelization strategy starts with one level of threads and moves to the exploitation of nested parallel regions in order to further...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012